2024-12-16 10:54:36.AIbase.14.0k
Alibaba Tongyi Laboratory Voice Generation Model CosyVoice Upgraded to Version 2.0
The Alibaba Tongyi Laboratory voice team announced that its open-source voice generation model CosyVoice has been upgraded to version 2.0. This upgrade marks significant progress in voice generation technology in terms of accuracy, stability, and natural experience. CosyVoice 2.0 achieves bidirectional streaming voice synthesis through the integration of offline and streaming modeling techniques, allowing for a first-package synthesis delay of up to 150ms, thereby significantly improving the responsiveness of voice synthesis.